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Abstract 



We give a short proof that any comparison-based {n^ '^)-approxiniation algorithm for the 
1-dimensional Traveling Salesman Problem (TSP) requires fl{nlogn) comparisons. 

Introduction 

In September of 2012 David Eppstein asked the fohowing question on lcstheory.stackexchange.com 



"The one-dimensional traveling salesperson path problem is, obviously, the same thing 
as sorting, and so can be solved exactly by comparisons in 0(n log n) time, but it is 
formulated in such a way that approximation as well as exact solution makes sense. In 
a model of computation in which the inputs are real numbers, and rounding to integers 
is possible, it's easy to approximate to within a 1 -|- 0{n~'^) factor, for any constant c, 
in time 0{n): find the min and max, round everything to a number within distance 
(max — min)n^^^^^^ of its original value, and then use radix sort. But models with 
rounding have problematic complexity theory and this led me to wonder, what about 
weaker models of computation? 

So, how accurately can the one-dimensional TSP be approximated, in a linear compari- 
son tree model of computation (each comparison node tests the sign of a linear function 
of the input values), by an algorithm whose time complexity is o(n log n)? The same 
rounding method allows any approximation ratio of the form n^~°^^^ to be achieved (by 
using binary searches to do the rounding, and rounding much more coarsely to make it 
fast enough). But is it possible to achieve even an approximation ratio like 0{n^^^) for 
some e > 0?" [2] 

The purpose of this short note is to document, in a citable form, a negative answer. 

Theorem 1 For any constant e > 0, every comparison-based n}~'' -approximation algorithm for 
1-dimensional TSP requires O(enlogn) comparisons in the worst case. 

A "comparison-based" algorithm is one that queries the input only with binary (True/False) queries. 

Note: after the initial draft of this note, it was pointed out to me that Das, Kapoor and Smid 
proved the following result in 1997: 
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Theorem 2 ([T], Thm. 1) Let d > 1 he an integer constant. In the algebraic computation-tree 
model, any algorithm that, given a set S of n points in and a sufficiently large real number 
r < n, computes an r-approximate TSP-tour for S, takes Q{nlog{n/r)) time in the worst case. 

They also give related results for minimum-spanning trees. 

Note that, by taking r = n^"*^, their result essentially implies Thm. [TJ The proof in [Ij uses a 
Ben-Or-style proof, different from the direct proof here. 

Proof of Theorem [1] 

Fix any constant e > and an arbitrarily large positive integer n. 

Consider just the n! "permutation" input instances {xi, X2, . . . , Xn) that are permutations of 
[n] = {1, 2, . . . , n}. The optimum solution for any such instance has cost n — 1. 

Define the cost of a permutation tt to be c(7r) = \7r{i + 1) — vr(z)|. Model the algorithm as 
taking as input a permutation vr, returning some permutation vr', and paying cost d{'ir, vr') = c{tt'ott). 

Define C = C{n,e) to be the minimum number of comparisons for any comparison-based 
algorithm to achieve competitive ratio n^"*^ on these instances. Since opt is n — 1, the algorithm 
must guarantee cost at most n^"*^. We will show C > Q{enlogn) — 0{n). 

Define P = P{n,e) to be, for any possible output vr', the fraction of possible inputs for which 
output vr' would achieve cost at most n^~^. This fraction is independent of tt'. 

P also equals the probability that, for a random permutation vr, its cost c(7r) is at most n^"*". 
(To see why, take tt' to be the identity permutation /. Then P is the fraction of inputs for which 
d{ir,I) at most n^~^, but d{'K,I) = c(7r).) 

Lemma 1 C(n,e) > log2 1/P(n, e). 

Proof. Fix any algorithm that uses less than log2 1/-P comparisons on each of the inputs. The 
decision tree for the algorithm has depth less than log2 so there are less than 1/P leaves, and, 
for some output permutation vr', the algorithm gives vr' as output for more than a P fraction of the 
inputs. By definition of P, for at least one such input, the output vr' gives cost more than n^"*^, so 
the algorithm is not a n^~'^-approximation algorithm. □ 

Lemma 2 P{n,e) < exp(— r2(enlogn)). 

Before we prove Lemma EJ note that the two lemmas together prove the theorem: 

1 

C > log2 — = log2 exp(r2(enlogn)) = Q,{enlogn). 
To finish proving the theorem, we prove Lemma [2j 

Let vr be a random permutation. Recall that P equals the probability that its cost c{tt) is at 
most n^"*^. Say that any pair {i,i + 1) is an edge with cost \7r{i + 1) — 7r(i)|, so c(7r) is the sum of 
the edge costs. 

Suppose c(7r) < n^"*^. 

Then, for any g > 0, at most n^~^/g of the edges have cost q or more. Say that edges of cost 
less than q are cheap. 

Fix q = n^""^/^. Substituting and simplifying, at most n^"*^/^ of the edges are not cheap. 
Thus, at least n — n^"*^/^ > n/2 of the edges are cheap. Thus, there is a set S containing n/2 
cheap edges. 
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Claim 1 For any given set S o/n/2 edges, the probability that all edges in S are cheap is at most 
exp{—Q{en logn)). 



Before we prove the claim, note that it implies Lemma[2]as follows. By the claim, and the naive 
union bound, the probability that any there exists such a set S is at most 

exp(— ri(en log n)) < 2" exp(— fifen log n)) 

n/2J 

< exp(0(n) — $7(en log n)) < exp(— il(enlogn)). 

To complete the proof of the lemma and the theorem, we prove the claim. 

Choose TT by the following process. Choose 7r(l) uniformly from [n], then choose 7r(2) uniformly 
from [n] — {7r(l)}, then choose 7r(3) uniformly from [n] — {7r(l), 7r(2)}, etc. 

Consider any edge {i, i + 1) in S. Consider the time just after Tr{i) has been chosen, when 
7r(i + 1) is about to be chosen. Regardless of the first i choices (for 7r(j) for j < i), there are at 
least n — i choices for 7r(i + 1), and at most 2n^^'^/^ of those choices will give the edge (i, i + 1) cost 
less than n^"'^/'^ (making it cheap). 

Thus, conditioned on the first i choices, the probability that the edge is cheap is at most 
min(l, ) . Thus, the probability that all n/2 edges in S are cheap is at most 

T-r / 2ni-^/2 
I I min I 1, 

V n — i 
(j,j+i)e5 

Since \S\ > n/2, there are at least n/4 edges in S with n — i > n/4. Thus, this product is at most 

r>„l-e/2 

< (8n-'/2^"/^ = exp(0(n) -O(enlogn)) = exp(-J](en log n)). 



^ n/4 

This proves the theorem. □ 



References 

[1] Gautam Das, Sanjiv Kapoor, and Michiel Smid. On the complexity of approximating euclidean 
traveling salesman tours and minimum spanning trees. Algorithmica, 19(4):447~462, 1997. 

[2] David Eppstein. Approximate Id tsp with linear comparisons? 

http : / / cstheory . stackexchange . com/ quest ions/ 12593/ approximate- Id- tsp- with- linear- comparisoE 
September 2012. 



3 



